43 research outputs found

    Quantitative analysis of mass spectrometry proteomics data : Software for improved life science

    Get PDF
    The rapid advances in life science, including the sequencing of the human genome and numerous other techiques, has given an extraordinary ability to aquire data on biological systems and human disease. Even so, drug development costs are higher than ever, while the rate of new approved treatments is historically low. A potential explanation to this discrepancy might be the difficulty of understanding the biology underlying the acquired data; the difficulty to refine the data to useful knowledge through interpretation. In this thesis the refinement of the complex data from mass spectrometry proteomics is studied. A number of new algorithms and programs are presented and demonstrated to provide increased analytical ability over previously suggested alternatives. With the higher goal of increasing the mass spectrometry laboratory scientific output, pragmatic studies were also performed, to create new set on compression algorithms for reduced storage requirement of mass spectrometry data, and also to characterize instrument stability. The final components of this thesis are the discussion of the technical and instrumental weaknesses associated with the currently employed mass spectrometry proteomics methodology, and the discussion of current lacking academical software quality and the reasons thereof. As a whole, the primary algorithms, the enabling technology, and the weakness discussions all aim to improve the current capability to perform mass spectrometry proteomics. As this technology is crucial to understand the main functional components of biology, proteins, this quest should allow better and higher quality life science data, and ultimately increase the chances of developing new treatments or diagnostics

    DIANA—algorithmic improvements for analysis of data-independent acquisition MS data

    Get PDF
    Motivation: Data independent acquisition mass spectrometry has emerged as a reproducible and sensitive alternative in quantitative proteomics, where parsing the highly complex tandem mass spectra requires dedicated algorithms. Recently, targeted data extraction was proposed as a novel analysis strategy for this type of data, but it is important to further develop these concepts to provide quality-controlled, interference-adjusted and sensitive peptide quantification. Results: We here present the algorithm DIANA and the classifier PyProphet, which are based on new probabilistic sub-scores to classify the chromatographic peaks in targeted data-independent acquisition data analysis. The algorithm is capable of providing accurate quantitative values and increased recall at a controlled false discovery rate, in a complex gold standard dataset. Importantly, we further demonstrate increased confidence gained by the use of two complementary data-independent acquisition targeted analysis algorithms, as well as increased numbers of quantified peptide precursors in complex biological samples. Availability and implementation: DIANA is implemented in scala and python and available as open source (Apache 2.0 license) or pre-compiled binaries from http://quantitativeproteomics.org/diana. PyProphet can be installed from PyPi (https://pypi.python.org/pypi/pyprophet). Supplementary information: Supplementary data are available at Bioinformatics onlin

    Moduli of Parabolic Higgs Bundles and Atiyah Algebroids

    Get PDF
    In this paper we study the geometry of the moduli space of (non-strongly) parabolic Higgs bundles over a Riemann surface with marked points. We show that this space possesses a Poisson structure, extending the one on the dual of an Atiyah algebroid over the moduli space of parabolic vector bundles. By considering the case of full flags, we get a Grothendieck-Springer resolution for all other flag types, in particular for the moduli spaces of twisted Higgs bundles, as studied by Markman and Bottacin and used in the recent work of Laumon-Ng\^o. We discuss the Hitchin system, and demonstrate that all these moduli spaces are integrable systems in the Poisson sense.Comment: 34 pages. Some small edits, corrected minor mistake in proof of lemma 2.1. Added journal referenc

    Givental symmetries of Frobenius manifolds and multi-component KP tau-functions

    Get PDF
    We establish a link between two different constructions of the action of the twisted loop group on the space of Frobenius structures. The first construction (due to Givental) describes the action of the twisted loop group on the partition functions of formal Gromov-Witten theories. The explicit formulas for the corresponding tangent action were computed by Y.-P. Lee. The second construction (due to van de Leur) describes the action of the same group on the space of Frobenius structures via the multi-component KP hierarchies. Our main theorem states that the genus zero restriction of the Y.-P. Lee formulas coincides with the tangent van de Leur action.Comment: 25 page

    Not on the edge: the syntax and pragmatics of clause-initial negation in Swedish

    Get PDF
    The possibility of topicalizing sentential negation is severely restricted in the Germanic V2-languages. In this paper, we show that negative preposing was more frequent and less restricted in earlier stages of Swedish: approx. 8 % of all occurrences of negation are clause initial in Old Swedish, compared to less than 0.5 % in present day Swedish. We propose that this change in frequency can be traced to the syntactic status of the negative element. More specifically, we argue that Old Swedish eigh 'not' may function as a syntactic head and cliticize to the finite verb in [C-0]. This possibility is not open to the XP inte 'not' in Modern Swedish. In Modern Swedish, we argue that the restrictions on negative preposing instead are related to more general pragmatic restrictions on the information expressed in [Spec,CP]: according to our hypothesis, negative preposing is licensed by contrast

    The Witten-Reshetikhin-Turaev invariant for links in finite order mapping tori I

    Get PDF
    We state Asymptotic Expansion and Growth Rate conjectures for the Witten-Reshetikhin-Turaev invariants of arbitrary framed links in 3-manifolds, and we prove these conjectures for the natural links in mapping tori of finite-order automorphisms of marked surfaces. Our approach is based upon geometric quantisation of the moduli space of parabolic bundles on the surface, which we show coincides with the construction of the Witten-Reshetikhin-Turaev invariants using conformal field theory, as was recently completed by Andersen and Ueno.Comment: 41 pages. Minor changes, to appear in Adv. Mat

    Analysis of bacterial surface interactions with mass spectrometry-based proteomics

    No full text
    Host–pathogen protein–protein interaction networks are highly complex and dynamic. In this experimental protocol we describe a method to isolate host proteins attached to the bacterial surface followed by quantitative mass spectrometry based proteomics analysis. This technique provides an overview of the host–pathogen interaction network, which can be used to guide directed perturbations of the system, and to select target of specific interest for further studies

    Improvements in Mass Spectrometry Assay Library Generation for Targeted Proteomics

    No full text
    In data-independent acquisition mass spectrometry (DIA-MS), targeted extraction of peptide signals in silico using mass spectrometry assay libraries is a successful method for the identification and quantification of proteins. However, it remains unclear if high quality assay libraries with more accurate peptide ion coordinates can improve peptide target identification rates in DIA analysis. In this study, we systematically improved and evaluated the common algorithmic steps for assay library generation and demonstrate that increased assay quality results in substantially higher identification rates of peptide targets from mouse organ protein lysates measured by DIA-MS. The introduced changes are (1) a new spectrum interpretation algorithm, (2) reapplication of segmented retention time normalization, (3) a ppm fragment mass error matching threshold, (4) usage of internal peptide fragments, and (5) a multilevel false discovery rate calculation. Taken together, these changes yielded 14-36% more identified peptide targets at 1% assay false discovery rate and are implemented in three new open source tools, Fraggle, Tramler, and Franklin, available at https://github.com/fickludd/eviltools . The improved algorithms provide ways to better utilize discovery MS data, translating to substantially increased DIA performance and ultimately better foundations for drawing biological conclusions in DIA-based experiments
    corecore